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A PFA has been developed for the SiD detector concept at a future Linear Collider. 
The performance of the version of this PFA used in the SiD LOI is presented for a 
number of physics processes with two hadronic jets. Presented at LCWS08 [I]. 

1 Introduction 

Reconstruction in SiD is based on the particle flow concept in which calorimeter energy 
deposits from individual particles are separated, allowing the energy of each to be measured 
in the optimal subsystem for that particle (the silicon tracker for charged particles, the 
EM calorimeter for photons, both calorimeters for neutral hadrons). In the limit of perfect 
separation, the contribution to the jet energy resolution from charged particles is negligible 
and only neutral hadrons need to have their energy measured in the hadronic calorimeter, 
leading to a jet energy resolution of roug hljQ 20%/^ 0- In practice, this limit is difficult 
to achieve. Degradation of the resolution due to imperfect separation of energy deposits is 
generically referred to as confusion, and is the most important effect for well-contained, high- 
energy jets in the SiD acceptance. A particle flow algorithm (PFA) has been developed and 
tuned for SiD in the org . lesim software framework with the goal of minimizing the confusion 
and therefore the resolution. A snapshot of the PFA has been used for the analysis and 
benchmarking results reported in this LOI; development is still in progress and performance 
is expected to continue improving in future versions. 

A deliberate effort has been made to keep the code as modular as possible. Different 
components communicate with one another by reading and writing named objects in stan- 
dard formats to the event-level data store. This makes the flow of information clear, and 
allows one component to be substituted for another. 

For each event, the SiD PFA takes as inputs the energy deposits in the calorimeters 
and muon system and the set of tracks found in the tracking system (as described in Refer- 
ence [3]). The PFA then performs the reconstruction in a series of steps, described in detail 
in Reference [4]. The general strategy for pattern-recognition in the calorimeters is (a) to 
identify and set aside the easiest, most distinctive showers first, taking maximum advantage 
of the information, and (b) to recognize common classes of mistakes made earlier in the 
algorithm and correct for them. The PFA produces as output a collection of reconstructed 
particles suitable for use in a physics analysis. 



2 Performance 



The true test of performance is the sensitivity to key physics observables — this is discussed 
in Reference [5]. However, for the purposes of studying and optimizing a PFA it is helpful 
to look at specific physics processes which are simple to analyse and depend primarily on 
the quality of the PFA output. We use two such processes: 

a E is in units of GeV throughout. 



LCWS/ILC2008 



S 200- 
> z 
g 180- 

Sl60 r 

1 140 j 

I 120^ 

LU 

100^ 

80 E- 




u :-1.24GeV 
rms m : 4.25 GeV 



10 20 30 40 
Dijet mass residual (GeV/c 2 ) 



■a 100- 
> 

a 

S 80 - 



u : -2.53 GeV 
rms m : 3.50 GeV 



10 20 30 40 
Dijet mass residual (GeV/c 2 ) 



Figure 1: Dijet mass residuals for e + e — ► Z(qq)Z(vv) events at yfs = 500 GeV, shown for 
the barrel (left) and endcap (right) regions of polar angle. 



• e + e~ — > qq at yfs = 100,200,360,500 GeV, for q = u,d,s. Bcamstrahlung and 
bremstrahlung in the initial state arc disabled so that the collision energy Ecm is the 
same as yfs. The figure of merit is the event energy sum residual AEqm, i-e. the signed 
difference between the reconstructed and true values of Ecm- Under the simplifying 
assumption that the invariant mass of two jets with energies E\ and E2 and opening 
angle #12 is given by mf 2 = 2EiE2(l — cos 6*12), the resolution of energy sum residuals 
is equal to the resolution of the dijet mass for jets of the same energy. 

• e + e" — > Z{qq)Z(yv) at y/s = 500 GeV, for q = u,d, s. The figure of merit is the dijet 
mass residual AM, the signed difference between the reconstructed and true values of 
m q q. Plots of the residual distribution are shown in Figure [U 

In both cases, the figure of merit depends upon the quality of hadronic jet reconstruction 
but does not require jet-finding or corrections for primary neutrinos. 

Table Q] shows the measured resolutions^ for the sid02 detector. The resolution is quoted 
separately for the barrel (0 < | cos(0)| < 0.8) and endcap (0.8 < | cos(0)| < 0.95) regions of 
polar angle. There are several effects at work: 

• The calorimetric component of the resolution function is expected to scale as \/~E, 
i.e. slower than linear. When this dominates, the fractional resolution (<tab cm /£^cm) 
decreases as the energy goes up. 

• The confusion component of the resolution function will increase as the jet energy goes 
up and pattern-recognition becomes harder. The encrgy-dcpcndcncc is not known from 
first principles, but it is likely to be at least linear if not faster. 

• At high energies, leakage of energy out of the back of the calorimeter becomes im- 
portant. The impact on the resolution has a strong angular dependence, since the 
effective depth of the calorimeter varies with cos#; this is illustrated in Figure [2l This 
effect is modest for jet energies of 180 GeV but becomes dominant by 250 GeV. It is 
partially mitigated in the endcap region by using the muon system as a tail-catcher; 

b Resolutions are quoted in terms of rmsgo , the RMS of the contiguous block of 90% of events with smallest 
RMS. Similarly, /igo is defined to be the mean of these events. Note that for a Gaussian distribution, the 
rmsgo is approximately 78% of the full RMS. 
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Table 1: PFA performance for sid.02. For the e + e~ — ► qq processes, the rmsgo of the energy 
sum residuals is quoted as a fraction of yfs, and for the e + e~ — > ZZ process the rmsgo of 
the dijet mass residuals is quoted as a fraction of mz- Resolutions are quoted for the LOI 
production snapshot and do not include subsequent improvements. 

this depends strongly on the longitudinal segmentation in the muon system and was 
found to be much more effective with an absorber thickness of 5 cm than 20 cm. 

• For the ZZ events, the requirement that both jets lie in the angular region of interest 
constrains the kinematics of the decay. 

• The dijet mass resolution measured in e + e~ — > ZZ events is observed to be larger 
than the resolution seen in e + e~ — > qq events, even when the jet energy is comparable. 
This may be due to non-linearity in the energy response: the e+er — > qq events have 
mono-energetic jets by construction and so a non-linear response would simply shift 
the mean of the energy sum distribution, whereas the jets in e + e~ — > ZZ events 
can be quite asymmetric and therefore the dijet mass residual distribution would be 
broadened by such an effect. 

The resolutions in Table [1] are larger than those seen when running the PandoraPFA 
algorithm on the ILD detector design [6]. Understanding this difference is not straight- 
forward: the performance of a PFA and the design of the detector on which it runs are 
coupled and it is not meaningful to take either in isolation. It is also technically very dif- 
ficult to run one PFA on the other detector. However, a work-around has been developed: 
by starting from the LDCOOSc detector and adjusting the calorimeter geometry and layer- 
ing, we can produce "SiDish" detectors which have similar dimensions to sid.02 and run 
PandoraPFA v2.01@ on them [7]. Note that the SiDish detectors still use the same detector 
technology as LDCOOSc, though: a TPC tracker and iron/scintillator HCAL, unlike SiD's 
silicon tracker and iron/RPC HCAL. 

The e+e" — > qq event energy sum resolution in 0.0 < | cos#| < 0.7 found when running 
PandoraPFA on SiDish detectors resembling sid02 is 3.1% for yfs = 90 GeV and 2.8% for 
yfs = 200 GeV, superior to the performance we find in Table[T](3.7% and 3.0%, respectively). 
Part of this difference is due to the difference between sid02 and the SiDish detectors. In 
previous studies comparing SiD detectors with scintillator and RPC instrumentation of the 
HCAL, we found that the scintillator variant had better performance by about 10% relative 
(0.3% absolute). Likewise, the use of a TPC tracker gives more complete information for 
decays and interactions inside the tracking system (e.g. for Kg — > 7r + 7r~); we can place an 

c PandoraPFA has continued to develop in the meantime; the current version is v03-/3. 
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upper bound on this of 0.3% for V« = 100 GeV and 0.2% for y/s = 200 GeV from studies 
with cheat tracking. These effects are sufficient to explain most of the observed performance 
difference between PandoraPFA and the SiD PFA. 

3 Conclusions 

There has been a great deal of progress in 
SiD reconstruction since ALCPG07. We 
have switched to full track reconstruc- 
tion and found that PFA performance in 
e + e~ — * qq events remains close to that of 
cheat tracking. The PFA itself has been 
largely rewritten and gives event energy 
sum resolutions of order 3.0-3.5% for jet 
energies up to 250 GeV. The PFA perfor- 
mance was found to be approaching that of 
the gold standard, PandoraPFA, when run- 
ning on a comparable detector design for jet 
energies up to 200 GeV. This is very encour- 
aging for the jet physics prospects at SiD. 

Nonetheless, there is a great deal of im- 
provement still to come. A number of code 
fixes have already been made [4] and more 
substantial revisions such as the integra- 
tion of calorimeter-assisted tracking [8] are 
planned. At the broadest level, the two 
principal challenges are: (1) to understand 
the impact of leakage in high-energy jets on 
the physics potential of the detector, and to 
reduce it by adapting the algorithm and de- 
tector design if needed; and (2) to improve 
the reconstruction algorithm, and in partic- 
ular to reduce the dijet mass resolution seen 
in e + e _ — ► ZZ events. The confusion term still dominates the resolution for the range of 
jet energies likely to be used in physics analyses at a 0.5 TeV or even 1 TcV collider: we 
have plenty of room for improvement. 
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Figure 2: Resolution (rmsgo of the energy sum 
residuals) as a function of angle for e + e~ — > qq 
for y/s = 200,500 GeV. For 200 GeV (solid), 
leakage is not significant and the angular dis- 
tribution is roughly flat, rising slowly towards 
| cos 6*| = 0.975 as the effects of acceptance 
and tracking become important. For 500 GeV 
(dashed), leakage has a major impact — this 
can be seen from how the resolution varies 
in the barrel between | cos 6*| = where the 
calorimeter thickness is minimized to | cos#| = 
0.8 where it is greatest. For both energies, the 
resolution is very bad for | cos6*| > 0.975 due 
to acceptance losses. 
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